Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Lessons from the Palestine Post Project

Identifieur interne : 002121 ( Main/Exploration ); précédent : 002120; suivant : 002122

Lessons from the Palestine Post Project

Auteurs : Ronald W. Zweig

Source :

RBID : ISTEX:04F7AB691415E05200E93B53C2011C8449340242

Abstract

The digitization of an entire run of a historic newspaper has createda resource for researchers that opens up the wealth of information contained in a daily newspaper. By using a technology that combines full text retrieval with high resolution images of the original, this project provides the best of both worlds in information retrieval and document delivery. The project has solved the problem of accessibilityof difficult-to-use, rare and voluminous source material. However, bymaking 40,000 pages of broadsheet newsprint instantly accessible, we have generated an entirely new problem. A surfeit of information has been created. While it is possible to be nourished by a stream, it isimpossible to drink when the floodgates of the dam have been opened. New techniques of information retrieval and understanding will be required before we can approaches in these directions are suggested in the article.

Url:
DOI: 10.1093/llc/13.2.89


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Lessons from the Palestine Post Project</title>
<author wicri:is="90%">
<name sortKey="Zweig, Ronald W" sort="Zweig, Ronald W" uniqKey="Zweig R" first="Ronald W." last="Zweig">Ronald W. Zweig</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:04F7AB691415E05200E93B53C2011C8449340242</idno>
<date when="1998" year="1998">1998</date>
<idno type="doi">10.1093/llc/13.2.89</idno>
<idno type="url">https://api.istex.fr/document/04F7AB691415E05200E93B53C2011C8449340242/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000195</idno>
<idno type="wicri:Area/Istex/Curation">000192</idno>
<idno type="wicri:Area/Istex/Checkpoint">001625</idno>
<idno type="wicri:doubleKey">0268-1145:1998:Zweig R:lessons:from:the</idno>
<idno type="wicri:Area/Main/Merge">002238</idno>
<idno type="wicri:Area/Main/Curation">002121</idno>
<idno type="wicri:Area/Main/Exploration">002121</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Lessons from the Palestine Post Project</title>
<author wicri:is="90%">
<name sortKey="Zweig, Ronald W" sort="Zweig, Ronald W" uniqKey="Zweig R" first="Ronald W." last="Zweig">Ronald W. Zweig</name>
<affiliation>
<wicri:noCountry code="subField">Isrsel</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="1998-06">1998-06</date>
<biblScope unit="volume">13</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="89">89</biblScope>
<biblScope unit="page" to="94">94</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">04F7AB691415E05200E93B53C2011C8449340242</idno>
<idno type="DOI">10.1093/llc/13.2.89</idno>
<idno type="ArticleID">13.2.89</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">The digitization of an entire run of a historic newspaper has createda resource for researchers that opens up the wealth of information contained in a daily newspaper. By using a technology that combines full text retrieval with high resolution images of the original, this project provides the best of both worlds in information retrieval and document delivery. The project has solved the problem of accessibilityof difficult-to-use, rare and voluminous source material. However, bymaking 40,000 pages of broadsheet newsprint instantly accessible, we have generated an entirely new problem. A surfeit of information has been created. While it is possible to be nourished by a stream, it isimpossible to drink when the floodgates of the dam have been opened. New techniques of information retrieval and understanding will be required before we can approaches in these directions are suggested in the article.</div>
</front>
</TEI>
<affiliations>
<list></list>
<tree>
<noCountry>
<name sortKey="Zweig, Ronald W" sort="Zweig, Ronald W" uniqKey="Zweig R" first="Ronald W." last="Zweig">Ronald W. Zweig</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002121 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002121 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:04F7AB691415E05200E93B53C2011C8449340242
   |texte=   Lessons from the Palestine Post Project
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024